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Abstract. Originating from Allen's Interval Algebra, composition-based reason- 
ing has been widely acknowledged as the most popular reasoning technique in 
qualitative spatial and temporal reasoning. Given a qualitative calculus (i.e. a re- 
lation model), the first thing we should do is to establish its composition table 
(CT). In the past three decades, such work is usually done manually. This is un- 
desirable and error-prone, given that the calculus may contain tens or hundreds 
of basic relations. Computing the correct CT has been identified by Tony Cohn 
as a challenge for computer scientists in 1995. This paper addresses this problem 
and introduces a semi-automatic method to compute the CT by randomly gen- 
erating triples of elements. For several important qualitative calculi, our method 
can establish the correct CT in a reasonable short time. This is illustrated by ap- 
plications to the Interval Algebra, the Region Connection Calculus RCC-8, the 
INDU calculus, and the Oriented Point Relation Algebras. Our method can also 
be used to generate CTs for customised qualitative calculi defined on restricted 
domains. 



1 Introduction 

Since Allen's seminal work of Interval Algebra (lA) fVl], qualitative calculi have been 
widely used to represent and reason about temporal and spatial knowledge. In the past 
decades, dozens of qualitative calculi have been proposed in the artificial intelligence 
area "Qualitative Spatial & Temporal Reasoning" and Geographic Information Science. 
Except lA, other well known binary qualitative calculi include the Point Algebra 1 3 1, the 
Region Connection Calculi RCC-5 and RCC-8 |4|, the INDU calculus |5 1, the Oriented 
Point Relation Algebras OVUA IS), and the Cardinal Direction Calculus (CDC) Q, 
etc. 

Relations in each particular qualitative calculus are used to represent temporal or 
spatial information at a certain granularity. For example. The Netherlands is west of 
Germany, The Alps partially overlaps Italy, I have today an appointment with my doctor 
followed by a check-up. 

Given a set of qualitative knowledge, new knowledge can be derived by using con- 
straint propagation. Consider an example in RCC-5. Given that The Alps partially over- 
laps Italy and Switzerland, and Italy is a proper part of the European Union (EU), and 
Switzerland is discrete from the EU, we may infer that The Alps partially overlaps 
the EU. The above inference can be obtained by using composition-based reasoning. 
The composition-based reasoning technique has been extensively used in qualitative 
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spatial and temporal reasoning, and, when combined with backtracking methods, has 
been shown to be complete in determining the consistency problem for several im- 
portant qualitative calculi, including lA, Point Algebra, Rectangle Algebra, RCC-5, 
and RCC-8. Moreover, qualitative constraint solvers have been developed to facilitate 
composition-based reasoning B8I9I . 

We here give a short introduction of the composition-based reasoning technique. 
Suppose is a qualitative calculus, and F — {vi%jVj}" is a constraint network 
over Ai. The composition-based reasoning technique uses a variant of the well-known 
Path Consistency Algorithm^which applies the following updating rule until the con- 
straint network becomes stable or an empty relation appears: 

lij ^ lij ^ lik °w Ikj, (1) 

where a /3 is the weak composition (cf [lO'lll) of two relations a, /3 in A^, namely 
the smallest relation in M which contains the usual composition of a and (3. Although 
for OPRA and some other calculi the composition-based reasoning is incomplete to 
decide the consistency problem, it remains a very efficient method to approximately 
solve the consistency problem. 

The weak composition in a qualitative calculus is determined by its weak com- 
position table (CT for short). Usually, the CT of M is obtained by manually checking 
the consistency of {xay,yPz,x^z} for each triple of basic relations {a,j,/3). When 
Ai contains dozens or even hundreds of basic relations, this consistency -based method 
is undesirable and error-prone. (12] first noticed this problem and identified it as a chal- 
lenge for computer scientists. 

This problem remains a challenge today. We here consider several examples. The 
Interval Algebra and the RCC-8 algebra contain, respectively, 13 and 8 basic relations. 
Their CTs were established manually. But if a calculus contains a hundred basic rela- 
tions, we need to determine the consistency of one million such basic networks. This 
is manually impossible. The OVTZA calculi and the CDC are large qualitative spatial 
calculi that have drawn increasing interests. OVTZAm contains Am x {4m + 1) (i.e. 
72, 156, 272 for m — 2,3, 4, respectively) basic relations IS], while the CDC contains 
218 basic relations |7|. Sometimes we need ingenious and special methods to establish 
CT for such a calculus. For the OVTZA calculi, the algorithm presented in the original 
paper H contains gaps and errors. Later, im presented the second algorithm, which 
is quite lengthy and cumbersome. Another simple algorithm has also been proposed 
recently |14|. Given the huge number of basic relations of OVTZAm, the validity of 
these algorithms need further verification. As for the CDC, |7| first studied the weak 
composition. Later, fl5\ noticed errors in Goyal's method and gave a new algorithm 
to compute the weak composition. Unfortunately, in several cases, their algorithm does 
not generate the correct weak composition (see i 16|). 

In this paper, we respond to this challenge and propose a semi-automatic approach 
to generate CT for general qualitative calculi. In the remainder of this paper, we first 
recall basic notions and results about qualitative calculi and weak composition tables in 
Section 2, and then apply our method to lA, INDU, RCC-8, and OVTZAi and OVTZA2 

' The notion of Path Consistency is usually defined for constraints on finite domains, and not 
always appropriate for general qualitative constraints, which are defined on infinite domains. 
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in Section 3. An analysis of the strength and weakness of our approach is given in 
Section 4. Section 5 then concludes the paper. 

2 Preliminaries 

In this section we recall the notions of qualitative calculi and their weak composition 
tables. Interested readers may consult e.g. 1 1 7 1 1] for more information. 

Definition 1. Suppose U is a universe of spatial or temporal entities, and B is a set 
of jointly exhaustive and pairwise disjoint (JEPD) binary relations on U. We call the 
Boolean algebra generated by B a qualitative calculus, and call relations in B the basic 
relations of this qualitative calculus. 

We consider a simple example. 

Example 1 (Point Algebra). Suppose [/ = M. For two points a, b in U, we have either 
a < 6, or a = 6, or a > 6. Let B ~ {<, >}• Then S is a JEPD set of relations on U . 
We call the Boolean Algebra generated by B the Point Algebra. 

We next recall the central notion of weak composition. 

Definition 2. Suppose M is a qualitative calculus on U, and B is the set of its basic 
relations. The weak composition of two basic relations a and j3 in Ai, denoted as a 
j3, is defined as the smallest relation in M which contains ao (3, the usual composition 
of a and /3. 

Usually, a qualitative calculus has a finite set of relations. The weak composition 
operation of M can be summarised in an n x n table, where n is the cardinality of B, and 
the cell specified by a and /3 contains all basic relations 7 in S such that 7000/? ^ 0. 
The CT of the Point Algebra is given in Table [T] 

Table 1. The CT of the Point Algebra, where * is the universal relation 
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Definition 3. Suppose Ai is a qualitative calculus on U with basic relation set B. For 
basic relations a, /3, 7, we call (a, 7, /3) a composition triad, or c-triad, if^ '~= a 

We can determine if a 3-tuple is a c-triad as follows (cf. Fig.[T]). 

Proposition 1. A 3-tuple {a, 7, /3) of basic relations in M is a c-triad iff^C]aoj3 ^ 0, 
which is equivalent to saying that the basic constraint network 

{xay, yf3z, xjz} (2) 

is consistent, i.e. it has a solution in U. 

To compute the weak composition of a and /?, one straightforward method is to find 
all basic relations 7 such that {a, 7, /3) is a c-triad. 
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X ' z 

Fig. 1. A c-triad (a, 7, /3) 

3 A General Method for Computing CT 

In this section, we propose a general approach to compute the composition table of a 
qualitative calculus with domain U and basic relation set B. The approach is based 
on the observation that each triple of objects in U derives a valid c-triad. 

Proposition 2. Suppose a, b, c are three objects in U. Then {p{a, 6), p{a, c), p{b, c)) is 
a c-triad, where p{x, y) is the basic relation in M that relates x to y. 

It is clear that six (different or not) c-triads can be generated if we consider all permu- 
tations of a, 6, c. 

To compute the CT of A^, the idea is to choose randomly a triple of elements in 
U and then compute and record the c-triads related to these objects in a dynamic ta- 
ble. Continuing in this way, we will get more and more c-triads until the dynamic 
table becomes stable after sufficient large loops. The basic algorithm is given in Al- 
gorithm [T] where _D is a subdomain of U, 9 decides when the procedure terminates. 
Triad records the number of c-triads obtained when the procedure terminates, and 
LastFound records the time when the last triad is first recorded. For a calculus with 
unknown CT, the condition may be assigned with the form Loop < 1, 000, 000 (i.e., 
the algorithm loops one milHon times), or Loop < LastFound + 100, 000 (i.e., until 
no new c-triad is found in the last one hundred thousand loops), or their conjunction. If 
the CT is known and we want to double-check it, then the boundary condition could be 
set to Triad < TV to save time, where N is the number of c-triads of the calculus. 

We make further explanations here. 

Suppose is a qualitative calculus on U . Recall U is often an infinite set. We need 
first to decide a finite subdomain D of U, as computers only deal with numbers with 
finite precision. Once D is chosen, we run the loop, say, one million times. Therefore, 
one million instances of triples of elements in D are generated. We then record all 
computed c-triads in a dynamic table. It is reasonable to claim that the table is stable if 
no new entry has been recorded after a long time (e.g. as long as the time has past to 
get all recorded c-triads). Because D is finite. Algorithm [T] will generate a stable table 
after a sufficient large number of iterations. 

We observe that a finite subdomain D may restrict the possible c-triads if it is se- 
lected inappropriately. We introduce a notion to characterise the appropriateness of a 
subdomain. 

Definition 4. Suppose M is a qualitative calculus defined on the universe U. A nonempty 
subset D ofU is called a 3-complete subdomain of A4 if each consistent basic network 
as specified in Eq. ^has a solution in D. 
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Algorithm 1: Computing the Composition Table of M. 
Input: A subdomain DofM, and a boundary condition ^ related to M 
Output: The Composition Table CT of M 

Initialise CT; 
Loop ^ 0; 
Triad 0; 
LastFound 4- 0; 
while 'P do 

Loop Loop + 1; 

Generate triple of objects [a, b, c) G randomly; 

a -f- the basic relation between a and 6; 

^ •<— the basic relation between b and c; 

7 -s— the basic relation between a and c; 

a' <— the basic relation between b and a; 

/5' ■<— the basic relation between c and 6; 

7' ^ the basic relation between c and a; 

for (r,6-,t) G {(a,7,/3), (a',/3,7), (7,a,/3'), {/3,a',7'), {/9',7',«'). (7',/3',«)} 
do 

if (r, s, t) is not in CT then 
Record triad (r, s, t) to CT; 
Triad Triad + 1; 
LastFound Loop; 

end 

end 

end 

return CT. 



If D is a 3-complete subdomain, then, for each c-triad (a, 7, /3), there are a, b,c in D 
such that (a, fe) G a, (6, c) G /3, and (a, c) G 7. Therefore, to determine the CT of A4, 
we need only consider instances of triples in D. 

Note that no matter whether the subdomain D is 3-complete, the algorithm always 
generates 'valid' triads, in the sense that any 3-tuple {a,j,/3) in the CT generated is 
indeed a c-triad of the calculus. However, the algorithm only converges to the correct 
CT when the subdomain D is 3-complete. 

It is of course important questions to find 3-complete subdomains or to decide if a 
particular subdomain is 3-complete. However, it seems that there is no general answer 
for arbitrary quaUtative calculi, since the questions are closely related to the semantics 
of the calculi. For a particular calculus, e.g. lA, this can be verified by formal analysis. 
Note that a superset of a 3-complete subdomain is also 3-complete. To make sure a 
chosen subdomain D is 3-complete, we often apply the algorithm on several of its 
supersets at the same time. If the same number is generated for all subdomains, we tend 
to beheve that D is 3-complete and the generated table is the CT of M. Note a formal 
proof is necessary to guarantee the 3-completeness of D. 

Even if a CT of Ai has been somehow obtained, our method can be used to verify its 
correctness. Double-checking is necessary since computing the CT is error-prone (see 
the last paragraph of page 1). If there is a c-triad that does not appear in the previously 
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given table, something must be wrong with the table, because the c-triads computed 
by Algorithm [T| are always valid. It is also possible that the algorithm terminates with a 
fragment of given composition table. We then can make theoretical analysis to see if the 
missing c-triads are caused by the incompleteness of the subdomain. If so, we modify 
the subdomain and run the algorithm again, otherwise, the missing c-triads are Ukely to 
be invalid c-triads. 

Another thing we should keep in mind is how to generate a triple of elements 
(a, b, c) from D. Note that if D is small (e.g. in the cases of PA and lA), we can gener- 
ate all possible triples. If D contains more than 1000 elements, then it will be necessary 
to generate the triples randomly as there are over a billion different triples. The distri- 
bution over D may affect the efficiency of the algorithm. Assuming that we have very 
limited knowledge of the calculus A^, it is natural to take a, b and c independently with 
respect to the uniform distribution. We note that the better we understand the calculus, 
the more appropriate the distribution we may choose. 

To increase the efficiency of the algorithm, we sometimes use the algebraic prop- 
erties of the calculus. For example, if the identity relation id is a basic relation, then 
hy a id = a = id a and id C a q;~, we need not compute the c-triads 
involving id, where a~ is the converse of a. This is to say, the algorithm only needs to 
generate pairwise different elements. As another example, suppose that the calculus is 
closed under converse, i.e. the converse of a basic relation is still a basic relation. Then 
in Algorithm [T] we need only compute a, f3, 7. The other relations and c-triads can be 
obtained by replacing a' , (3' , 7' in the algorithm by, respectively, a~, 7~. Similar 
results have been reported in ITSll . 

In the following we examine three important examples. All experiments were con- 
ducted on a 3.16 GHZ Intel Core 2 Duo CPU with 3.25 GB RAM running Windows XP 
Note the results rely on the random number generator. As our aim is to show the fea- 
sibility of the algorithm rather than investigating the efficiency issues, we only provide 
one group of the results and do not make any statistical analysis. 

3.1 The Interval Algebra and the INDU Calculus 

We start with the best known qualitative calculus. 

Example 2 (Interval Algebra). Let U be the set of closed intervals on the real line. 
Thirteen binary relations between two intervals x = [x^ ,x^] and y = [y , y^] are 
defined in Table |2] The Interval Algebra |2| is the Boolean algebra generated by these 
thirteen JEPD relations. 

The CT for lA has been computed in 1983 in Allen's famous work. When applying 
Algorithm [T] to lA, we do not consider all intervals. Instead, we restrict the domain to 
the set of all intervals contained in [0, M) that have integer nodes 

Dm = {[P, g e Z, < p < 9 < M}, 

and use uniform distribution to choose random intervals. It is easy to see that the size of 
the domain is M{M — l)/2. Note that to converge fast and generate all entries, we need 
to choose an appropriate M. If AI is too small, then it is possible that some c-triads can 
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Table 2. Basic lA relations and their converses, where x = 
intervals. 



[x = [y are two 



Relation 


Symbol 


Converse 


Meaning 
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= y+ 
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Table 3. Implementation for lA, where TRIAD is the number of c-triads recorded by running the 
algorithm on Dm for M = 4 to M — 20, LastFound is the loop when the last triad is first 
recorded 
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4 


5 
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10 


11 


12 


Triad 


139 


319 


409 


409 


409 


409 


409 


409 


409 


LastFound 


92 


629 


1501 


878 


2111 


3517 


728 


697 


932 



M 


13 


14 


15 


16 


17 


18 


19 


20 


Triad 


409 


409 


409 


409 


409 


409 


409 


409 


LastFound 


11212 


20249 


7335 


4343 


3632 


17862 


5533 


43875 



not be instantiated. On the other hand, if M is too big, relations that require one or more 
exact matches (such as m in lA and in the INDU calculus to be introduced in the 
next example) is very hard to generate, i.e. the probability of generating such an instance 
is very small. For a new qualitative calculus, there is no general rules for choosing M. 
Usually, pilot experiments are necessary to better understand the characteristics of the 
calculus. 

Table |3] summarises the results for M = A io M — 20. In the experiment, we 
generate one million instances of triples of elements for each domain D^. In all cases 
the dynamic table becomes stable in less than 50,000 loops. When the table becomes 
stable, the numbers of triads computed are not always the correct one (that is 409). This 
is mainly because the domain is too small. For AI bigger than or equal to six, we always 
get the correct number of triads|^The loops needed (i.e. LastFound) vary from less 
than a thousand to more than 43 thousand (see Table [3]l. In general, the smaller the 
domain is the more efficient the algorithm is. 

Example 3 (INDU calculus). The INDU calculus ([5) is a refinement of lA. For each 
pair of intervals a, b, INDU allows us to compare the durations of a, b. This means, 
some lA relations may be split into three sub-relations. For example, b is split into 
three relations b^,b~,b^. Similar situations apply to m.O,oi,mi, and bi. The other 
seven relations have no proper sub-relations. Therefore, INDU has 25 basic relations. 

INDU is quite unlike lA. For example, it is not closed under composition, and a 
path-consistent basic network is not necessarily consistent |fT9l. 

^ The 3-completeness of Dc, follows from the fact that each consistent lA network involving 
three variables has a solution in De- 
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Table 4. Implementation for INDU, where TRIAD is tlie number of c-triads recorded by running 
the algorithm on Dm for M = 6 to AI = 20, LastFound is the loop when the last triad is first 
recorded 



M 


6 


7 


8 


9 


10 


11 


12 


13 


Triad 


1045 


1531 


1819 


1987 


2041 


2053 


2053 


2053 


LastFound 


3766 


5753 


10417 


35201 


35891 


25031 


12512 


27728 



M 


14 


15 


16 


17 


18 


19 


20 


Triad 


2053 


2053 


2053 


2053 


2053 


2053 


2053 


LastFound 


17223 


24578 


14758 


22491 


29034 


49693 


19772 



Applying our algorithm to INDU, we use the same subdomain Dm as for lA. From 
Table |4] we can see that is no longer 3-complete: more than 1000 c-triads do not 
appear in the stable table. The table becomes complete in Du, which has 2053 c-triads. 
The 3-completeness of Du is confirmed by the following proposition. 

Proposition 3. The INDU calculus has at most 2053 c-triads. 

Proof (Sketch). For any three INDU relations a*\ f3*^ ,'-f*^ (*i,*2,*3 e {<, =, >}), 
it is easy to see that {a*'^ , 7*^ , f3*^ ) is a valid c-triad of INDU only if (a, 7, /?) is a valid 
c-triad of lA and (*i,*2,*3) is a valid c-triad of PA. We note that for lA relations in 
{d,s,f, eq,si,fi,di}, only d^,s<,f^,eq=,si^,fi^,di^ are valid INDU relations. It is 
routine to check that there are only 2053 triples of INDU relations that satisfy the above 
two constraints. We recall that lA has 409 c-triads (see Table |3]l, and PA has 13 c-triads 
(see Table [1]). □ 

Since 2053 valid c-triads are recorded by running the algorithm on Du for INDU, we 
know INDU has precisely 2053 c-triads, and Du is 3-complete for INDU. Moreover, 
we have that , 7"^^ , is a valid c-triad of INDU if and only if {a, 7, /3) is a valid 
c-triad of lA and (^i , *2 1 *3) is a valid c-triad of PA. 

It seems that this is the first time that the CT of INDU has been computed. 



3.2 The Oriented Point Relation Algebra 

In the OVTZAm calculus, where m is a parameter characterizing its granularity, each 
object is represented as an oriented point (o-point for short) in the plane. Each o-point 
has an orientation. Based on which, 2m — 1 other directions are introduced according 
to the chosen granularity. Any other o-point is located on either a ray or in a section 
between two consecutive rays. Each of these rays and sections is assigned an integer 
from to 4m — 1. The relative directional information of two o-points A, B is uniquely 
encoded in a pair of integer numbers (s, t), where s is the ray or section of A in which 
B is located, and t is the ray or section of B in which A is located. Such a relation is 
also written as Am^\B. In the case that the locations of A and B coincide, the relation 
between A and B is written as ,„Z^_B, where s is the ray or section of A in which 
the orientation of B is located. Therefore, there are ^m{Am + 1) basic relations in 
OVRAm- 



(a) (b) 
Fig. 2. Two o-points A, B with the OVTiA2 relation (a) 2^7 and (b) 2/1- 



There are two natural ways to represent o-points. One uses the Cartesian coordinate 
system, the other use polar coordinate system. We next show the choice of coordinate 
system will significantly affect the experimental results, which are compared with that 
ofUFl- 

In the Cartesian coordinate system, an o-point P is represented by its coordination 
(z, y) and its orientation 0. 

Definition 5. Let Mi and M2 be two positive integers. We define a Cartesian based 
subdomain of OVTZAm as 

D,{Mi,M2) - {((x, y),4>) ■.x,ye [-Mi, Ah] n Z, ,^ e <?>mJ, 

where <Pm2 = {0, 2tt/M2, • • • , (M2 - 1)/M2 x 27r}. 



Table 5. Implementation for OVTZAi on a Cartesian coordinated domain Dc{Mi, M2), where 
Triad is the number of c-triads computed by running the algorithm on Dc{Mi, M2) for M\ = 
6; LastFound is the loop when the last triad is first recorded for M2 = 8 (in the 2nd last row) 
and M2 = 16 (in the last row) 



M2 



10 12 16 



Triad 148 1024 1056 1024 1024 1440 1024 1408 1440 



Ml 


2 4 6 8 10 


LastFound (M2 
LastFound (M2 


= 8) 
= 16) 


8082 35932 411893 881787 > 1000000 
18618 295936 174490 > 1000000 > 1000000 



Our experimental results show that, for OVTZAi, the algorithm converges and gen- 
erates the correct CT for subdomains with Mi > 2 and M2 G {8, 16}. That is, the 
smallest 3-complete subdomain is -De (2, 8). 

For 0'PTZA2, however, the algorithm does not compute the desired CT in ten mil- 
lion loops. Actually, it is impossible to compute the desired CT if we use Cartesian co- 
ordination. Consider the following example. Suppose A, B, C are three o-points, such 
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that AABC is an acute triangle, and the orientation of A is the same as the direction 
from A to B, the orientations of B and C are similar. In this configuration, we have 
A2Z.I1B, B2ZIC, and A2ZjC. This configuration, however, cannot be realised in a 
Cartesian based subdomainQ 

Table 6. Implementation for 0'PTZA2 on a Cartesian coordinated domain Dc{M\, M2), where 
Triad is the number of c-triads computed by running Algorithm [T] ten million times on 

DciMi,M2) for Ml = 6 



Ma 


2 4 6 8 10 12 16 


Triad 


2704 2704 21792 23616 21792 21792 35232 



Based on the above observation, we turn to the polar coordinated representation. In 
the polar coordinate system, an o-point P is represented by its polar coordination {p, 9) 
and its orientation (p. 

Definition 6. Let AIi and M2 be two positive integers. We define a polar coordinated 
subdomain of OVTZAm as 

Dp{Mi,M2) = {{{p, 0),(j)) : p e [0, Ml] n Z, 0, e ^M,}, 

where ^Mo. = {0, 27r/M2, • • • , (M2 - 1)/M2 x 2tt}. 

As in Cartesian based subdomains, the parameter M2 determines if a domain is 
complete, while Mi determines the efficiency of the algorithm. For OVTZAi, we have 
D{Mi,M2) is a 3-complete subdomain if Mi > 2 and M2 = 6,8, 10, 12, 16 (see 
Table |7]i; for 0'PnA2, we have L»(Mi,M2) is 3-complete if Mi > 4 and M2 = 
6, 10, 12, 16 (see Table[8]). 



Table 7. Implementation for OVTZAi on a polar coordinated domain Dp{Mi, M2), where 
Triad is the number of c-triads computed by running the algorithm on Dp{Mi, M2) for 
Ml = 6; LastFound is the loop when the last triad is first recorded for M2 = 8 (in the 
2nd last row) and M2 = 16 (in the last row) 



Ma 2 3 



10 12 16 



Triad 52 1024 1032 1408 1440 1440 1440 1440 1440 



Ml 


4 


6 


8 10 


16 


LastFound (M2 -- 


= 8) 


3072 


4868 


22327 10363 


38843 


LastFound (Ma = 


16) 


26219 


45831 


121542 71205 


146536 



3.3 The Region Connection Calculus 

Our algorithm works very well for simple objects like points and intervals. We next 
consider a region-based topological calculus RCC-8. It is worth noting that an auto- 

^ The proof of this statement is much involved and omitted in this paper. 
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Table 8. Implementation for 0'PTZA2 on a polar coordinated domain Dp{Mi, M2), where 
Triad is the number of c-triads computed by running the algorithm on Dp (Mi, M2) for A/i — 6 



M2 


2 3 4 6 8 10 12 16 


Triad 


400 24672 2128 36256 23616 36256 36256 36256 



mated derivation of the composition table was reported in ll20ll for a similar calculus 
(the 9-intersection model). 

Example 4 (RCC-8 algebra). Let U be the set of bounded plane regions (i.e. nonempty 
regular closed sets in the plane). Five binary relations are defined in Table|9] The RCC-8 
algebra |4| is the Boolean algebra generated by these five relations, the identity relation 
EQ, and the converses of TPP and NTPP. 



Table 9. A topological interpretation of basic RCC-8 relations in the plane, where a, b are two 
bounded plane regions, and a° , b° are the interiors of a, 6, respectively. 



Relation 


Meaning 


DC 


a n 6 = 


EC 


a n & 7^ 0, a° n 6° = 


PO 


a g b, fe g a, a° n 6° / 


TPP 


a C b,a (;t b° 


NTPP 


acb° 



Table 10. Implementation for RCC-8, where TRIAD is the number of c-triads computed by run- 
ning the algorithm on Dm using rectangles, LastFound is the loop when the last triad is first 
recorded 



M 


4 


5 


6 


8 


10 


15 


20 


Triad 


114 


177 


192 


192 


192 


192 


192 


LastFound 


14776 


6513 2332646 


56067 


198255 


261729 


1521173 



Plane regions are much more compUcated to represent than intervals or o-points. In 
most cases they are approximated by polygons or digital regions (i.e., a subset of Z^). 
Furthermore, it is natural to take a shot on simple objects at the beginning, since they are 
easy to deal with and important in applications. For RCC-8, we make experiments over 
two subdomains: rectangles and disks. The experiments show that these subdomains 
are good enough for our purpose, but when necessary, we could also consider general 
polygons or bounded digital regions. 

We first consider subdomains whose elements are rectangles sides of which are 
parallel to the two axes. We introduce one parameter M, and require the four nodes 
be points in [0, M) x [0, M) n Z^. The complete RCC-8 CT has 193 table entries. 
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Since EQ o EQ = EQ, we know (EQ, EQ, EQ) is a c-triad. The other 192 c-triads 
can be confirmed using our algorithm. In Table 10 we show the results of running the 
algorithm 10 million times and require M vary from 4 to 20. We can see from the table 
that Dm is a 3-complete subdomain only if M > 6. 



Table 11. Implementation for RCC-8, where TRIAD is the number of c-triads computed by run- 
ning the algorithm on Dm using disks, LASXFotJND is the loop when the last triad is first 
recorded 



M 


4 5 


6 


8 


10 


15 


20 


Triad 


188 192 


192 


192 


192 


192 


192 


LastFound 


1759 8913 


9489 


25955 


113757 


942914 


2961628 



We next consider subdomains consisting of disks (see Table [TTJi. We introduce one 
parameter M, and require x,y ^ [0, M] OZ, r €i [1, M] n Z, where {x, y) and r are, 
respectively, the centre and the radius of the closed disk B{{x, y),r)).ln this case, M = 
5 is good enough to generate all c-triads. We notice that the number of loops needed 
(i.e. LastFound) increases quickly as AI increases. For example, when M — 20, the 
dynamic table becomes stable after nearly 3 million loops. This is mainly due to that an 
instance of the c-triad (NTPP, NTPP, NTPP) is very hard to generate. The 'hard' 
c-triad is, however, easy to prove. 



4 Further Discussions 

In the last section, we have applied our algorithm to generate the CTs of lA, INDU, 
OVTZAi, OVTIA2, and RCC-8. In this section, we discuss the advantages and disad- 
vantages of our method. The algorithm works very well for simple objects like points, 
intervals, rectangles, and disks, especially in a small subdomain. For a qualitative cal- 
culus with less than 100 basic relations, it can compute the CT in a few minutes. 

We also considered larger calculi. The Oriented Point Relation Algebras OVTlA'i 
and OVTZAi have, respectively, 156 and 272 basic relations. Applying our algorithm 
to an appropriate polar coordinated subdomain Dp{Mi, M2), 261,576 and 1,082,752 
c-triads, respectively, have been found in a few hours, which coincide with those com- 
puted in (T4\. This implies that the corresponding subdomains are 3-complete. 

For calculi defined over regions, the main obstacle of using our approach is the cost 
of generating random regions. For RCC-8, we circumvent this obstacle by taking rect- 
angles and disks. But this circumvention does not work for the Cardinal Direction Cal- 
culus (CDC) Ul, as the shape of the region matters in this calculus. The CDC contains 
218 basic relations. We run our algorithm for the CDC on the subdomain containing 
digital regions in [0,5] x [0, using normal distribution. The result is not ideal. After 
one day, we have computed about 60% of the total 2.2 million c-triads of the CDC. 
Improvements will be made later, adopting more appropriate or heuristic distribution. 



The 3-completeness of this subdomain is confirmed by results reported in 1161 . 



13 



In many applications of qualitative calculi, the objects used are often restricted. 
Take OVTZA2 as example. In many real world applications, e.g. the Interstate Highway 
System of the USA, oriented objects are all taken from a underlying graph. In these 
cases, each o-point has only a few possible directions. To support reasoning with this 
domain, we had better have a customised 0'PTZA2 calculus, together with a customised 
CT. Our algorithm works perfect to this end. For example, consider the restriction of 
OVTZA2 calculus on 1? . Each o-point in this calculus has integer coordinations and 
has one of the four orientations from {0,7r/2,7r, 37r/2}. Using our algorithm, the CT 
for this customised calculus has been generated in a few minutes. Experiment result 
shows that this customised calculus has 2704 c-triads. 

Even for the well-known Interval Algebra, our algorithm suggests a new viewpoint 
for efficient reasoning. We note that, based on our method, we can easily compute the 
probability of each basic relation in the weak composition of any two basic lA relations. 
This may be used in approximate temporal reasoning, especially when the application 
domain has a different structure than the universe of lA. Work towards this direction 
will be reported in another paper. 

5 Conclusion 

In this paper, we introduced a general and simple semi-automatic method for comput- 
ing the composition tables of qualitative calculi. The described method is a very natural 
approach, and similar idea was used to derive composition tables for an elaboration of 
RCC with convexity EH . and for a ternary directional calculus ll22ll . The table com- 
puted in fTT\ was acknowledged there as incomplete. The table computed in ll22l is 
complete, but its completeness was guaranteed by manually checking all geometric 
configurations that satisfy the table. Except these two works, very little attention has 
been given to this natural approach in the literature on composition tables. We think a 
systematic examination is necessary to discover both the strong and weak points of this 
approach. 

We implemented the basic algorithm for several well-known qualitative calculi, in- 
cluding the Interval Algebra, INDU, OVUAm for to = 1 - 4, and RCC-8. Our exper- 
iments suggest that the proposed method works very well for point-based calculi, but 
not so well for region-based calculi. In particular, we established, as far as we know, 
for the first time the correct CT for INDU, and confirmed the validity of the algorithm 
reported for the OVTZA calculi lfT4l . Our method can be easily integrated into exist- 
ing qualitative solvers e.g. SparQ jS) or GQR [9]. This provides a partial answer to the 
challenge proposed in 1 12) . 

Recently, Wolter proposes (in an upcoming article 1231 ) to derive composition tables 
by solving systems of polynomial (in)equations over the reals. This approach works 
well for several point-based calculi, but not always generates the complete composition 
table. 

Our method relies on the assumption that the qualitative calculus has a small 'discre- 
tised' 3-complete subdomain. All calculi considered in this paper satisfy this property. 
It is still open whether all interesting calculi appeared in the literature satisfy this prop- 
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erty. Future work will also discuss the applications of our method for reasoning with a 
customised composition table. 
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